Estimation of regression quantiles in complex surveys with data missing at random: An application to birthweight determinants.

نویسنده

  • Marco Geraci
چکیده

The estimation of population parameters using complex survey data requires careful statistical modelling to account for the design features. This is further complicated by unit and item nonresponse for which a number of methods have been developed in order to reduce estimation bias. In this paper, we address some issues that arise when the target of the inference (i.e. the analysis model or model of interest) is the conditional quantile of a continuous outcome. Survey design variables are duly included in the analysis and a bootstrap variance estimation approach is proposed. Missing data are multiply imputed by means of chained equations. In particular, imputation of continuous variables is based on their empirical distribution, conditional on all other variables in the analysis. This method preserves the distributional relationships in the data, including conditional skewness and kurtosis, and successfully handles bounded outcomes. Our motivating study concerns the analysis of birthweight determinants in a large UK-based cohort of children. A novel finding on the parental conflict theory is reported. R code implementing these procedures is provided.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Estimation of E(Y) from a Population with Known Quantiles

‎In this paper‎, ‎we  consider the problem of  estimating E(Y) based on a simple random sample when at least one of the population quantiles is known‎. ‎We propose a stratified estimator of  E(Y)‎, ‎and show that it is strongly consistent‎. ‎We then establish the asymptotic normality of the suggested estimator‎, ‎and prove that it ...

متن کامل

Kernel Estimation of Distribution Functions and Quantiles with Missing Data

A distribution-free imputation procedure based on nonparametric kernel regression is proposed to estimate the distribution function and quantiles of a random variable that is incompletely observed. Assuming the baseline missing-at-random model for nonrespondence, we discuss consistent estimation via estimating the conditional distribution by the kernel method. A strong uniform convergence rate ...

متن کامل

Design-Based Estimation for Geometric Quantiles

In this paper, we are interested in estimating geometric quantiles when data are obtained in a complex survey. Geometric quantiles defined by Chaudhuri (1996) are an extension of univariate quantiles in the multivariate set-up that uses the geometry of multivariate data clouds. A very important application of them is the detection of outliers in multivariate data through quantile contours. This...

متن کامل

The Effect of Education on Labor Wages in Iranian Urban Households Based on Quantile Regression

The purpose of this article is to examine the impact of education and work experience on earning. For this purpose, Mincer’s wage equation, quantile regression estimation method and the microdata from Iranian survey of household income and expenses in 2016 have been used. Estimation results show that education returns are positive in all income quantiles, and education in lower-income quantiles...

متن کامل

Imputation methods for quantile estimation under missing at random

Imputation is frequently used to handle missing data for which multiple imputation is a popular technique. We propose a fractional hot deck imputation which produces a valid variance estimator for quantiles. In the proposed method, the imputed values are chosen from the set of respondents and are assigned with proper fractional weights that use a density function for the working model. In addit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Statistical methods in medical research

دوره 25 4  شماره 

صفحات  -

تاریخ انتشار 2016